Voice Simulation: Factors Affecting Quality And Naturalness

نویسندگان

  • Bayya Yegnanarayana
  • Jayant M. Naik
  • Donald G. Childers
چکیده

In this paper we describe a f lexib le analysls-synthesls system which can be used for a number of studies In speech research. The maln objective Is to have a synthesis system whose characteristics can be controlled through a set of parameters to realize any desired voice characteristics. The basic synthesis scheme consists of two steps: Generation of an excitation signal from pitch and galn contours and excitation of the linear system model described by linear prediction coefficients, We show that a number of basic studies such as time expansion/ compression, pitch modif icat ions and spectral expansion/compression can be made to study the e f fec t of these parameters on the qua l i ty of synthetic speech. A systematic study is made to determine factors responsible for unnaturalness tn synthetic speech. I t i s found that the shape of the g lo t ta l pulse determines the qua l i ty to a large extent. We have also made some studies to determine factors responsible for loss of I n t e l l i g i b i l i t y tn some segments of speech. A signal dependent analysts-synthesis scheme ts proposed to improve the i n t e l l i g i b i l i t y of dynamic sounds such as stops. A simple implementation of the signal dependent analysis is proposed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Voice Analysis in English and Persian Persuasive Texts: Pedagogical implications in focus

The main purpose of this study is to investigate how voice is realized by Iranian EFL learners in persuasive English and Persian text types. This discourse-related notion is a required criterion for writing acceptable English. However, L2 learners from cultures other than English might face problems in realizing it, or even ignore it all through their writing. In this connection, the present st...

متن کامل

Voice Analysis in English and Persian Persuasive Texts: Pedagogical implications in focus

The main purpose of this study is to investigate how voice is realized by Iranian EFL learners in persuasive English and Persian text types. This discourse-related notion is a required criterion for writing acceptable English. However, L2 learners from cultures other than English might face problems in realizing it, or even ignore it all through their writing. In this connection, the present st...

متن کامل

On-line experimental methods to evaluate text-to-speech (TTS) synthesis: effects of voice gender and signal quality on intelligibility, naturalness and preference

Three experiments are reported that use new experimental methods for the evaluation of text-to-speech (TTS) synthesis from the user’s perspective. Experiment 1, using sentence stimuli, and Experiment 2, using discrete ‘‘call centre’’ word stimuli, investigated the effect of voice gender and signal quality on the intelligibility of three concatenative TTS synthesis systems. Accuracy and search t...

متن کامل

Improvement of prosodic characteristic in Vietnamese speech synthesis system base on HMM

The key factors helping people to understand the synthesized voices of text-to-speech system are the naturalness and the intelligibility. However, making more natural voices remains a difficult task because of the speech data’s scarcity. With data limited corpus, prosodic information such as tone, intonation, Part-of-Speech is added to ensure the quality of synthetic speech. In the paper, we in...

متن کامل

Vocal Disorders and Risk Factors Affecting It: Voice Ergonomics in Teachers

Introduction: Nearly a third of people work in jobs that use voice to be part of their work. Teachers as the largest group of professional vocal users, are at risk of vocal disorders. The aim of this study was to investigate the effect of different risk factors on vocal disorders in teachers.   Material and Methods: This is a cross-sectional and descriptive-analytic study that was conducted on...

متن کامل

Factors affecting perceived quality and intelligibility in the CHATR concatenative speech synthesiser

In order to eliminate trial-and-error in the process of selecting a good speech database as a voice source for concatenative speech synthesis, and to determine the acoustic and prosodic characteristics that best predict `appeal' or perceived `quality' in the synthesised speech, we performed tests to evaluate listener preferences over a range of di erent synthesised voices. We found that variati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1984